AITopics | curriculum policy

Collaborating Authors

curriculum policy

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Synthetic experiments (R2, R4)

Neural Information Processing SystemsFeb-9-2026, 07:57:10 GMT

Teacher learning curve for Frozen lake: the student return induced by the teaching policy at the end of the curriculum improves as CISR trains more students. For CISR, we evaluate a teacher policy trained w/30 students on new test students, while Bandit learns by explore-exploit for each student as [27] can't learn from previous students. Thank you for your helpful comments! Using multiple students enables CISR's key novelty - allowing the teacher to learn This makes CISR applicable,e.g., in a flavor of sim-to-real transfer where a curriculum policy is learned in Thus, we have at least 270 possible curricula. CISR determines a good one after only 10 students attests to its learning ability.

artificial intelligence, machine learning, student, (13 more...)

Neural Information Processing Systems

Industry: Education (0.37)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.36)
Information Technology > Artificial Intelligence > Robots (0.31)

Add feedback

NeurIPS20_SafeCL

Matteo Turchetta

Neural Information Processing SystemsAug-15-2025, 02:03:35 GMT

curriculum policy, hyperparameter, student, (15 more...)

Neural Information Processing Systems

Industry: Education (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

NeurIPS20_SafeCL

Matteo Turchetta

Neural Information Processing SystemsAug-15-2025, 02:03:28 GMT

curriculum policy, intervention, student, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Washington > King County > Redmond (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)
North America > United States > Texas > Travis County > Austin (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Industry:

Education (1.00)
Leisure & Entertainment > Games > Computer Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)

Add feedback

8df6a65941e4c9da40a4fb899de65c55-AuthorFeedback.pdf

Neural Information Processing SystemsAug-15-2025, 02:03:16 GMT

intervention, latexit sha1, student, (15 more...)

Neural Information Processing Systems

Industry: Education (0.31)

Technology: Information Technology > Artificial Intelligence (0.96)

Add feedback

Safe Reinforcement Learning via Curriculum Induction

Turchetta, Matteo, Kolobov, Andrey, Shah, Shital, Krause, Andreas, Agarwal, Alekh

arXiv.org Artificial IntelligenceJun-22-2020

In safety-critical applications, autonomous agents may need to learn in an environment where mistakes can be very costly. In such settings, the agent needs to behave safely not only after but also while learning. To achieve this, existing safe reinforcement learning methods make an agent rely on priors that let it avoid dangerous situations during exploration with high probability, but both the probabilistic guarantees and the smoothness assumptions inherent in the priors are not viable in many scenarios of interest such as autonomous driving. This paper presents an alternative approach inspired by human teaching, where an agent learns under the supervision of an automatic instructor that saves the agent from violating constraints during learning. In this model, we introduce the monitor that neither needs to know how to do well at the task the agent is learning nor needs to know how the environment works. Instead, it has a library of reset controllers that it activates when the agent starts behaving dangerously, preventing it from doing damage. Crucially, the choices of which reset controller to apply in which situation affect the speed of agent learning. Based on observing agents' progress, the teacher itself learns a policy for choosing the reset controllers, a curriculum, to optimize the agent's final policy reward. Our experiments use this framework in two environments to induce curricula for safe and efficient learning.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

arXiv.org Artificial Intelligence

2006.12136

Country: North America > United States (0.28)

Genre:

Research Report (0.64)
Instructional Material > Course Syllabus & Notes (0.48)

Industry:

Education (1.00)
Transportation > Ground > Road (0.48)
Leisure & Entertainment > Games > Computer Games (0.46)
Information Technology > Robotics & Automation (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Learning Curriculum Policies for Reinforcement Learning

Narvekar, Sanmit, Stone, Peter

arXiv.org Artificial IntelligenceDec-1-2018

Curriculum learning in reinforcement learning is a training methodology that seeks to speed up learning of a difficult target task, by first training on a series of simpler tasks and transferring the knowledge acquired to the target task. Automatically choosing a sequence of such tasks (i.e. a curriculum) is an open problem that has been the subject of much recent work in this area. In this paper, we build upon a recent method for curriculum design, which formulates the curriculum sequencing problem as a Markov Decision Process. We extend this model to handle multiple transfer learning algorithms, and show for the first time that a curriculum policy over this MDP can be learned from experience. We explore various representations that make this possible, and evaluate our approach by learning curriculum policies for multiple agents in two different domains. The results show that our method produces curricula that can train agents to perform on a target task as fast or faster than existing methods.

artificial intelligence, machine learning, reinforcement learning, (13 more...)

arXiv.org Artificial Intelligence

1812.00285

Country:

North America > Canada (0.15)
North America > United States > Texas (0.14)

Genre: Research Report > New Finding (0.48)

Industry:

Leisure & Entertainment > Games > Computer Games (0.94)
Education > Curriculum (0.91)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.48)

Add feedback